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Abstract 

CN | We study a model of information aggregation and social learning recently pro- 

posed by Jadbabaie, Sandroni, and Tahbaz-Salehi, in which individual agents try 
to learn a correct state of the world by iteratively updating their beliefs using 
private observations and beliefs of their neighbors. No individual agent's private 
signal might be informative enough to reveal the unknown state. As a result, 
agents share their beliefs with others in their social neighborhood to learn from 
each other. At every time step each agent receives a private signal, and computes 
a Bayesian posterior as an intermediate belief. The intermediate belief is then 
■ averaged with the belief of neighbors to form the individual's belief at next time 

step. We find a set of minimal sufficient conditions under which the agents will 
learn the unknown state and reach consensus on their beliefs without any assump- 
tion on the private signal structure. The key enabler is a result that shows that 
using this update, agents will eventually forecast the indefinite future correctly. 
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^ . 1 Introduction 

H 

We discuss a model of how individuals might use relevant information available to 
them to form opinions about different social, economic, and political issues. Oftentimes, 
the "best" action an agent can take is not obvious and depends on certain unknown 
parameters that determine the agent's utility function. Consider the example of an 
institution making an investment decision. The expected utility of different investment 
options depend, among other things, on the expected policies of the next administration, 
which in turn depend on how likely different political candidates are to win the election. 
One can think of the candidate who is the most likely to win as an unknown parameter 
in the investment decision of institutions. We assume that the value of the parameter 
is an unknown state of the world that is exogenous to our model and is determined at 
time zero by nature. 

Agents make relevant observations about the unknown state. Frequently, not all the 
agents make the same observations, and not all the observations are equally informative. 
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Lack of access to all the relevant information is a motivation for individuals to share their 
opinions with others in order to learn from their personal experiences. The importance of 
social interactions on opinion dynamics have been documented in several examples such 
as consumer choice, and diffusion of medical innovations and agricultural practices pp. 

We study the scenario in which each individual has two sources of information: his 
personal observations, and those of his neighbors in a social network (e.g. friends, 
neighbors, colleagues, etc.). However, agents might not have direct access to personal 
experiences of their neighbors. Instead, we assume that they can only observe their 
neighbors' beliefs, i.e., their subjective probabilities of different feasible realizations of 
the unknown state of the world. An alternative equally valid interpretation is that 
agents play a particular repeated game of imperfect information where each agent can 
only observe the actions of his neighbors, and the actions completely reveal the beliefs 
of acting agents. 

To study the effect of social networks on learning, we analyze the model in [2] in 
which agents use an update other than Bayes' rule to incorporate the new information 
available to them. Agents repeatedly interact with their neighbors and use a simple 
rule to update their beliefs. Each agent first forms the Bayesian posterior given his 
observed private signal, as an intermediate step. He then updates his belief to the 
convex combination of his Bayesian posterior and the beliefs of his neighbors. 

There are two major motivations for considering non-Bayesian protocols: The pos- 
itive point of view comes from the choice theoretic studies showing existence of non- 
Bayesian opinion dynamics among people (see [3H5] for instance). The second motiva- 
tion comes from a normative point of view. Bayesian inference in social networks can be, 
except for certain simple scenarios, computationally complicated to carry out. Part of 
the complications is because there is no reason to believe that agents know the source of 
their neighbors' information. Rather, they have to infer it to be able to form an unbiased 
belief about the true state of the world. The complexities of Bayesian updating limit its 
applicability in practice. 

In [2], the authors show that, under some assumptions, this update eventually leads to 
social learning, even in finite networks: They show that agents can eventually forecast 
the immediate future correctly. Furthermore, they will eventually learn the unknown 
state, if for each agent there exist a signal that is the more probable under the true state 
of the world than any other state. This assumption "turns the deck" in favor of learning 
by assuming that agents are infinitely often notified, indirectly, of the true state of the 
world. 

We argue through a simple example that the assumption of having a "revealing" 
signal is very limiting. We show that agents will learn the state of the world under 
the much weaker assumption that they can distinguish the state collectively. Signals 
need not be independent among agents at the same time period. Instead, we require 
the signal structure to be such that the state is identifiable given the marginals of the 
likelihood function. We first prove that not only agents will forecast the immediate 
future correctly, but also they eventually will learn to forecast the indefinite future. We 
also show that there exist a signal sequence which is informative enough to let agents 
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identify the true state of the world, even if no revealing signal exists. The results signify 
that even when none of the agents have enough information to learn the true state of the 
world, and in spite of individual signals not being revealing enough, social interaction can 
aggregate pieces of information available to the individuals such that each and everyone 
of them can distinguish the true state of the world. Furthermore, this paper suggests 
a distributed, computationally tractable algorithm for learning in networks that can be 
applied to real world problems such as parameter estimation in sensor networks (cf. [5] 
and the references therein). 

Recently there has been a rapidly growing interest in models of learning in networks. 
For a survey of different models of non-strategic and strategic social learning see pQ. 
Since in the presence of social networks fully Bayesian inference faces tractability prob- 
lems, different types of simplifications have been proposed. The first group of models 
assumes that agents interact sequentially. Examples include models in [7HTT]. In such 
models each agent, having observed the actions of some of the agents acting before, 
takes an action. Each agent makes only one decision, and cannot reverse or change 
his choice. The other group of models suggests non-Bayesian rules of thumb for belief 
update. Examples include [2^ [T2TfTS] . 

2 The Model 

The social learning model we consider, was first proposed in [2] . Time is discrete and 
there is a finite number of agents, signals, and states of the world. 

Let 6 be the finite set of possible states of the world, and 9* G 6 be the true state that 
is determined at time zero by nature, and is unchanged thereafter. Let M = {1, 2, . . . , n} 
be the set of agents. At time t > each agent i has a belief about the true state, denoted 
by Hi t t(0), which is a probability distribution over G. 

At each time period t > 1 each agent i observes a private random signal uj i)t G Si 
where Si = {sj, s?, . . . , sf*} is the set of possible signals for agent i. Conditional on 
9 G G being the state of the world, the observation profile u t = (wi,t,W2,t, ■ ■ ■ > ^n,t) is 
generated according to the likelihood function £{-\9) with £i(-\9) as its ith marginal. 
Let F e = £(-\9) N be the product measure that determines the realization of signals 
conditioned on 9 being the state of the world. This definition allows for signals to be 
correlated among agents at the same time period, but makes them independent over 
time. Without loss of generality we assume that £i(si \9*) > for all s» G S^ Si is only 
the set of signals that are realized with positive probability conditioned on the true state 
of the world being 6*. 

Let Gj = {9 G G : £i(si \ 9) = £i(si \ 9*) for all s, G Si} be the set of states that are 
observationally equivalent to the true state 9* from the point of view of agent i, and let 
G = 0i R • • • n G n be the set of states that are observationally equivalent to the true 
state of the world from the point of view of all agents. 

(f^J 7 , P) is the probability triple, where Q = ([17=1 -F * s the smallest a-field 
that makes all Ui t measurable, and P = F e is the probability distribution determining 
the realization of signals. N stands for the set of natural numbers. We use u G £1 
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to denote the infinite signal sequence (ui,u 2 , ■ ■ ■ ), and E to denote the expected value 
operator with respect to the probability measure P. Let J 7 ^ = ^(u;^!, u ij2 , ■ ■ ■ ,0Oi,t) be 
the filtration generated by the observations of agent i up to time t, and let J-" t be the 
cr-field generated by the union of all for 1 < i < n. 

We say that the adapted random variables X t and Y t are asymptotically P-almost 
surely equal, denoted by X t '= ' Y t , if there exist Q C Q such that P(f2) = 1, and for all 
cu G fl and all e > 0, there exist T(cj, e) such that for all ti, t 2 > T(cj, e), 

\X tl -Y t2 \<e. 

It is an easy exercise to show that if X t a '=' Y t and Z t a '=' W t , then X t ±Z t a '=' Y t ±W t , 
and X t Z t a =- Y t W t . 

The interactions between the agents are captured by a directed graph G = (J\f,E). 
Let Mi — {j G H : (j, i) G E} be the set of neighbors of agent i. It is assumed that 
agent i can observe the belief of agent j if there exist a directed edge from i to j, that is 
G E. A graph is called strongly connected if there exist a directed path from any 
vertex to any other one. 

Each agent i starts with the initial belief /ii,o(#) that 9 is the true state of the world. 
At the end of period t, each agent observes the beliefs of his neighbors. At the beginning 
of the next period, agent i receives the private signal uj^t+i, and then uses the following 
rule to update his belief: 

iM^m = a u ^e) it{Ul ; t+ll6 \ + E a mM, (i) 

where m^ t (si) is defined for any Sj G Si as 

m itt (si) = ^2ii(si\ 9)fi i)t (0). 
see 

In the update in equation ([T]) each agent updates his belief to a convex combination 
of his own Bayesian posterior, given only his private signal and neglecting the social 
network, and his neighbors' previous period beliefs, aij is the weight agent i assigns to 
the opinion of agent j, and an, called the self-reliance of agent i, is the weight he assigns 
to his Bayesian posterior conditional on his private signal. We assume that a^ G M + 
and YljeM-uU} a *i = ^ ^ or ^ ne beliefs to remain a probability distribution over B after 
the update. 

It is sometimes more convenient to use vector notations. We use A to denote the 
n x n matrix whose ij element is a^, and use Ht{Q) to denote the n dimensional column 
vector whose ith. element is ^(0)- 

When there is no arrival of new information, this update becomes the same as DeG- 
root's naive learning model. Likewise, when a^ = for all j ^ i, the model is the same 
as the Bayesian learning model with no network structure. 

For all t > 0, Hi,t{6) and m iyt (si) are random functions adapted to J-'t, the former 
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on and the latter on Si. m itt (si) is the probability that agent % assigns, at time t, 
to signal s» being observed in the next time step, hence, it is called agent z's one step 
forecast. We can extend this notion to define the k-step forecast ra^s^i, Sj )2 , . . . , Sj^) as 
the forecast at time t of agent i that the signal sequence (s^i, s^, • • • , Sj,fc) € will 
be realized in the next time steps 

mi,t( s i,i> • • • > s *,k) = ^2 ^( s *.i> • • • ' Si . fc l ^)^i,t(^)) 

where s ij2) . . . , Si,fc| 0) is the shorthand for ^(s^il 6 1 ) £i(s i)2 | #) • • -4(si,fc| 

The update in equation ([[]) is local in the sense that each agent only needs the beliefs 
of his immediate neighbors to compute it. 

3 Asymptotic Learning 

In this section we find a set of sufficient conditions for learning when agents use 
equation (JTJ to update their beliefs. First we have to define what we mean by learning. 
For a discussion of different notions of learning and how they are related see [19] . The 
first one we examine here is the notion of weak merging of opinions. 

Definition 1 Agent i learns the likelihood function on a sample path u>, in the sense of 
weak merging of opinions, if along that path 

m i,t(-) — > 0*) as t — > oo. 

When an agent learns the likelihood function in this sense, he will know the proba- 
bility distribution according to which signals are generated. In other words, the agent 
can forecast the immediate future correctly, as if he knows the likelihood function. 

In [2], the authors show that if agents use the update in equation (TjQ) ; learning in the 
sense of weak merging of opinions occurs under the following assumptions: 

Assumption 1 

(a) The social network is strongly connected. 

(b) All agents have strictly positive self-reliances. 

(c) There exists an agent with positive prior belief on the true parameter 6* . 

Assumption HJ(a) allows for information to flow from any agent to any other one. 
Assumption [TJ(b) is to prevent agents from disregarding their personal experiences. 
Assumption [TJ (c) is what is known as a "grain of truth" in agent's prior belief. In the 
discrete setting, this is equivalent to absolute continuity of the initial forecasts with 
respect to the likelihood function, which is often a necessary condition for learning. 
The following theorem shows that weak merging of opinions occurs when Assumptions 
Ha)-(c) hold. 
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Table 1: The likelihood function of Example Q] 





H 


T 


9* 


1/3 


2/3 


61 


1/4 


3/4 


62 


3/5 


2/5 



Theorem 1 (Jadbabaie, Sandroni, &; Tahbaz-Salehi [2]) If Assumption HI holds, 



then 



with F '-probability one. 



m,,(0 I r 



as t — > 00, 



It can also easily be shown through counterexamples that none of the assumptions 
of the theorem can be relaxed. This theorem corresponds to weak merging of opinions. 
The much stronger notion of learning is asymptotic learning of the true state of the 
world. 

Definition 2 Agent i asymptotically learns the true parameter 9* on a sample path u>, 
if along that path 

Hi,t{Q*) — * 1 as t — > 00. 

To prove asymptotic learning authors add the assumption that for any agent i, there 
exists a signal s$ G Si and a positive number Si such that 



<5,<1 V0 $ 0;. 



(2) 



This assumption asks for existence of a signal that is more likely conditioned on 9* 
being the true state of the world rather than conditioned on any other state in 0j being 
the true state of the world. Under this assumption and provided that the conditions 
of Theorem [T] hold, the authors prove that all agents asymptotically learn the true 
parameter 9* with P-probability one. The condition in fl2]) guarantees that there exist a 
revealing signal that is observed infinitely often. The following example shows that this 
is a weak assumption. 

Example 1 Consider a strongly connected graph on two agents. Assume that = 
{9*,9i,9 2 }, and S% = S 2 = {H,T}. Also assume that the private signals of agents 
are independent and are generated according to the same probability distribution £(s\ 9) 
which is given by TableUl In this example there is no signal that satisfies condition |D). 
But the signal sequence (H,T,T) is most likely under 9* rather than any other 9 G 0. 
Furthermore, (H, T, T) is the shortest such signal sequence. 

In this paper we show that to prove asymptotic learning no assumption should be 
made other than the distinguishability of the true state of the world 9*. 
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Assumption 2 (Distinguishability) There is no 9 £ t/ja£ is observationally equiv- 
alent to 9* from the point of view of all agents, that is 



This is obviously a necessary condition for the agents to learn the true state of the 
world. In Theorem O which is the main result of this section we show that it is also 
sufficient. To that end, we first show in Theorem H] that the correct forecasts of the 
agents can be extended into the future. To prove these results we first need to present 
a few preliminary results. The following theorem is a variation of Borel-Cantelli lemma 
which can be found, among many other places, as Theorem 5.3.2 in [20J. 

Theorem 2 Let Tt, t > be a filtration with To = {0, Q} and A t , t > 1 a sequence of 
events with At £ Tt- Then 



The next theorem we need is the bounded convergence theorem for conditional ex- 
pectations. For a proof see, for instance, page 263 of [20J. 

Theorem 3 Suppose Y t — > Y , F '-almost surely, and \Y t \ < M for all t where M is a 
constant. If Tt t T^ then with IF '-probability one 



The next two lemmas are technical lemmas which will be used in proof of Theorem 



e = e>! n • • • n e n = {9*}. 




E(Y t \ T) ->E(y| T^). 



Lemma 1 If Assumption [J\ holds then 



E( 



0) 

Trii,t{ui,t+i) 



T t ) — > 1 as t — > oo, 



with ¥ -probability one for all 9 £ 0. 



Proof. 




7 



By a similar argument 

M 7 r Ft) > mm r^- 

mi jt {u; i)t+1 ) sieSi m i>t {si) 

The conditional expectation is sandwiched between two quantities, both of which go to 
one P-almost surely by Theorem [XJ, and so does it. ■ 

Lemma 2 If Assumption U\ holds then 

E*(fx t+1 (9)\F t ) a =- Am. 

Proof. We take expectations of both sides of equation (pQ) conditioned on T t - Since 
fjtj,t(9) is Tt measurable for all 1 < j < n, 

Taking the limit of the above equation as t — > oo and using Lemma dj 
E{fJ,i,t+i(9)\ Ft) a '=' a u fj, ijt (9) + E aij^jA )' 

which in vector form can be written as 

E(m + i(9)\F t ) a ^ s - Af, t . 

■ 

The next theorem shows that not only the agents eventually forecast the next step 
correctly, as shown in Theorem [fl but also they do so for the next k steps for any finite 
k. 

Theorem 4 // Assumption [I] holds, then 

m itt (si,i, ■ ■ ■ , s itk ) £i(s iyU . . . , s itk \ 9*) as t -)• oo, 
with P -probability one for all s^i, Sj )2 , ■ ■ ■ , G Si. 

Proof. To simplify notation we drop the subscript i from s^i, s^, • • • , Si,* whenever 
there is no risk of confusion. We use induction on k. For k = 1 the result is proved 
in Theorem [H Multiplying both sides of equation ([I]) by Tni it (^i,t+i)^i( s 2, ■ ■ ■ , s&| #) and 
summing over 6 1 G O, 

TO M (a; M+ i) y~Vj(s 2 , • • • , s k \ 9)m,t+i{9) = fln^^Km. s 2 , ■ ■ ■ , s k \ 9)m,t{ d ) 
eee eee 

+ m it t{ui,t + i) E^( s 2, • • • ,Sk\ 9) 22 aij^jA )- 
eee jeMi 
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Thus, 



mi tt (ui tt+1 )mi ; t + i(s2, . . ■ , Sfc) = a u mi it (uJi >t+1 , s 2 ,..., s k ) 

+ mi ) t(ui,t+i)^2£i[s2,...,s k \6) ^2 

(9). 



flee j&Ni 



By Lemma [2], 



a.a.s. 



mi >t {Ui it+1 )m it t + i{S2, ■ ■ ■ ,s k ) = a ti m iit {u iit+lj s 2 , . . . , s k ) 

+ m iit (a; i)t+1 ) J^(s 2 , . . . , s*| 6')E(/j,^i + i(6') |J^; 



flee 

- fl«imi,((wi, !+ i) ^/Xj, f (0)£j(s 2 , • • • , s k \ 



Since all the terms are positive and ^(s 2 , . . . , s k \ 9) is a constant, using Fubini's theorem 
[20] we can change the order of sum and expectation to get 



mi,t( u i,t+i) m i,t+i( s 2, ■ ■ ■ , Sk) '=' aiim ijt (ujij + i, s 2 , . . . , s k ) 

+ m i , t (o;i jt+ i)E(TOj ) t + i(s2, • • • , s k )\ T t ) 
- o ii m i)t (a; i) t + i)m iit (s2 3 • • • , s k ). 

By induction hypothesis m ijt +i(s2, . . . ,s k ) converges P-almost surely to £i(s 2 , . . . ,s k \ 9*). 
Since m ijt+1 is a probability measure it is bounded for all t. Also note that T t | T . 
Hence, we can use Theorem [3] to conclude that Efm^+i^, ■ ■ ■ ,s k ) \ Tt) converges P- 
almost surely to E(^(s 2 , . . . , s k \ 9*)\ J 7 ) which is just £j(s 2 , ■ ■ ■ , s k \ 9*). Therefore, 

mi,t(ui,t+i)£i(s2, ■ ■ ■ , s k \ 9*) °"=' anmi t t(ui tt +x, s 2 , ■ ■ ■ , s k ) 

+ m i>t (uji >t+1 )£ i (s2,. ■ ■ ,s k \9*) 
- a ii m i j(u itt+1 )£ i (s 2 , ...,s k \ 9*). 

We can now solve for m itt (uj ijt+1 , s 2 , . . . , s k ) to get 

fn itt (u itt+1 , s 2 , . . • , s k ) °"=' mu(ui t t +1 )m itt (s 2 , ■■■,s k ). 

By the induction hypothesis, m^ t (s 2 , ■ ■ ■ , s k ) converges on a set of P-probability one 
to £i(s 2y ■ ■ , s k \ 9*). Moreover, m itt (u)i t t+i) is asymptotically P-almost surely equal to 
£i(uJi tt +i\ 9*). Therefore, mi t t(wi,t+i, s 2r ■ ■ , s k ) is asymptotically P-almost surely equal to 
&i(u)i t t+i, s 2 , . . . , s k \ 9*). To get the result for any arbitrary si, s 2 , . . . , s k G Si, we need to 
use Theorem [2j For any e > 0, let 



At = <uj : 



m,t-i(ui,t, s 2 , ...,s k ) _ 



(o; M ,s 2 , . . . ,s fc | 0*) 



> e 
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and for any S\ G Si, let 



B t ( Sl ) = {co: 



m ijt _i(si,s 2 , ...,s k ) 



[si, s 2 , ■ ■ ■ , s k \ 9*) 



> e 



Clearly A t is measurable with respect to Tt for all t > 1. Since asymptotically P-almost 
surely m t (u itt+1 , s 2 , . . . , s k ) is equal to £i(u ijt+1 , s 2 , . . . , s k \ 9*), F(A t i.o.) = for any 
e > 0, where i.o. is the abbreviation for infinitely often. Using Theorem [2] for the A n 
defined above, 

oo 

£>(A t |.F t _i) < oo, (3) 
t=i 

with P-probability one. We can write the summand in equation fl3]) as 

P(A t |7U) = l( J2 l^( sl )l{o, i , t = sl }|-7 r t-i), (4) 
sies 

where for any A G J 7 , 1a is a random variable defined as 



Ufa;) 



1 if a; G A, 
if a; £ A. 



1st (si) e -^t-i fo r an y s i e ^> an d u i,t is independent of J-t_i. Since all the random 
variables in equation (j3J) are positive, we can switch the order of sum and expected 
value to get 



l&S 



laical in- 



siGS 1 

If we substitute the above equation for P(A t |.F t _i) in equation ([3]) and use the fact that 
all the terms are positive to switch the order of the sums, we get that 

oo 

sxGS t=l 

with P-probability. Therefore, F(B t (si) i.o.) = for any si G and m^ t (si, s 2 , . . . , s^) 
converges P-almost surely to £i{s\, s 2 , . . . ,s k \ 9*) for any s x , s 2 , . . . , Sfe G S^. m 

This theorem shows that, eventually, an agent's forecast at time t of a time t + k 
event is not different than his forecast at time t + k — 1 of the same event. That is, when 
agents already have accurate forecasts of immediate future, they cannot improve their 
forecasts of indefinite future by observing more signals. 



10 



The following theorem uses finiteness of Si to show that for any agent i there exist 
a long enough signal sequence that is more probable under 9* than any state 9 ^ 0j. 

Theorem 5 For any agent i, there exists a finite number hi and signals s^x, §^2, ■ ■ ■ , s- £. 
such that 

£i\^i,l: Sj^, • • • , Sj f I w 



< 5i < 1 V0 £ 6 4 . (5) 



Proof. First assume that £i{si\ 9*) is a rational number for all Sj G In this case we will 
prove that we can take ki to be the least common denominator (LCD) of 9*)} Si ESi 
and (s^i, §i t 2, ■ ■ ■ , s i %..) to be a sequence of signals in which the number of occurrences of 
each Si G Si is exactly equal to the numerator of the fractional representation of £i(si \ 9*), 
when the denominator is equal to fc,. In other words, we pick the signal sequence in which 
the frequency of each signal is equal to its probability under 9*. 

Let k = LCD({£ i (s i | 9*)} s . eS .) and k\ = £(s{\ 9*% for 1 < j < M t . We prove that 
£i(-\9*) is the unique probability measure for which the probability of the signal sequence 




is maximized. As a result, for this sequence £i(- \ 9)/£i(- \ 9*) is strictly less than one for 
any 9 <£ 0j. 

Let p{ = Q(sf) for 1 < j < Mi, where Q is some probability measure on Si. To 
simplify notation we drop the subscript i whenever there is no risk of confusion. We 
solve the following concave maximization problem: 

I l\ k1 / 2\ fc2 ( M\ kM 

max (p 1 ) (p 2 ) ...(p M ) 
p 1 7—,p m (6) 

subject to p 1 + p 2 + ■ ■ ■ + p M = 1, 

■ 

where by (p 3 ) we mean p 3 to the power of k 3 . We can use a Lagrange multiplier p 
to incorporate the constraint into the cost function [21J. The resulting unconstrained 
optimization is still concave. Therefore, the optimal solutions can be found using the 
first order conditions. The resulting set of equalities are 

k 3 

- - P = 1 < j < M, 
p 3 

p 1 +p 2 + ---+p m = 1, 



which have the unique solution 

k 3 

By construction, this solution corresponds to the probability measure £i(-\9*). 
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So far we have proved the theorem for the rational case. For the case that £j(sj| 9*) 
is irrational for some s$ G Si, the proof follows from continuity of the objective of 
the optimization problem (JBJ with respect to (p 1 , p 2 , . . . jp 7 ), and the fact that rational 
numbers are dense in reals. ■ 

The condition in (jSJ) is the fc-step counterpart of the condition in (j2J). We showed 
that even though a single signal revealing the true state might not exist, but there are 
signal sequences that will do so. We are now ready to prove the main result of this 
section: 

Theorem 6 If Assumption^ holds, then 

Hi )t {6) ^ as t^oo Vi G ftf, V0 £ 0, 
with P '-probability one for all i G A/". 

Proof. First we prove that for any agent i, /ij,t(0) — > as t — > oo for all ^ 0j 
with P-probability one. Let fcj and (s^i, Sj )2 , . . . , s i £.) be a positive integer and a se- 
quence of signals respectively, that satisfy (JSJ). By Theorem HJ m^s^i, • • • , s^k) — > 
£i(si t i, • ■ ■ , s it k\ 0*) with P-probability one for any sequence of finite length. We can use 
this result for s^i, Sj j2 , . . . , s i £. to conclude that for any e > 0, there exist and a 

random variable Ti(u, e) such that P(f2i) = 1, and for any oj G f2j and t > Tj(u;, e), 



Si,l 



' • • • ' "i,** 1 



S - 0* 



< e, 



and therefore, 



Xj>«(0); 



l s i,l) • • • j S^feJ 
• • • , S^J 



< e. 



Using the result of Theorem [5] we can conclude that on any sample path oj G Qi and 

t > Ti(u,e), 

Since e > is arbitrary, this proves that /^(0) — » as £ — > oo for all Oj. 

Taking limits of equation (JTJ) as £ — >• oo and using the result proved above shows that 
J2jeAfi a i3^3,t{^) converges to zero as t — > oo and so does Hj,t{0) for all ^ Oj and j G Mi 
with P-probability one. Proceeding in the same way and using the strong connectivity 
assumption, for all j G M, p>j.t{9) — > for all 9 ^ Oj. Thus, with P-probability one 



//i )t (0) ^ as £ oo Vi G A/", V0 ^ 0. 



(7) 
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Since iM,t(') is a probability distribution over O for all i and t, we have the following 
corollary. 

Corollary 7 If Assumptions^ and\B hold, then 

IM,t{Q*) ~ * 1 as t — > oo, 
with F '-probability one for all i e A/". 

4 Conclusion and Future Direction 

We analyzed a model of social learning where the agents form their beliefs about an 
unknown state of the world by taking the convex combination of their Bayesian posterior 
and the beliefs of their neighbors. Agents will learn the true state of the world if, (a) the 
social network is strongly connected, (b) all agents have strictly positive self-reliances, 
and (c) there exists an agent with positive prior belief on the true state. Furthermore, 
none of these assumptions can be relaxed in general. We also argued that there does not 
need to exist a revealing signal for learning to happen. Rather, there are always long 
enough signal sequences that are revealing. We also proved that once the immediate 
future forecasts of the agents become approximately correct, they can be extended to 
indefinite future with a negligible error. 

We also have shown that, under the same assumptions, not only the agents eventually 
learn the true state of the world, but also they do so exponentially fast with an exponent 
that depends both on network topology and on signal structure. Therefore, even though 
network structure had a minimal effect on the possibility of learning through requiring 
strong connectivity, it influences the rate of learning in a more complicated and subtle 
way. Moreover, as a corollary of exponential learning, eventually the consensus dynamic 
becomes dominant. Hence, even if the true state is not distinguishable, agents eventually 
reach a consensus. 

As we mentioned in Section [Tj, the model of social learning where the agents exchange 
their beliefs with their neighbors can be regarded as a trivial game of imperfect infor- 
mation where the actions of the agents completely reveal their beliefs. We would like to 
extend the model to nontrivial games where the observed actions only contain limited 
information about the underlying beliefs. Then, to be able to use the simple update 
discussed in this paper, agents have to estimate the belief of their neighbors based on 
the history of their observed actions. We are interested in quantifying how, and to what 
extent unobservability of the beliefs will change the results of this paper. 
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